Dense arrangements are locally very dense I. 

Jozsef Solymosi* 
February 1, 2008 



Abstract 



The Szemeredi- Trotter theorem |18j gives a bound on the maxi- 
mum number of incidences between points and lines on the Euclidean 
plane. In particular it says that n lines and n points determine 0(re 4//3 ) 
incidences. Let us suppose that an arrangement of n lines and n points 
defines cn 4 / 3 incidences, for a given positive c. It is widely believed that 



such arrangements have special structure, but no results are known 
in this direction. Here we show that for any natural number, k, one 
can find k points of the arrangement in general position such that any 
pair of them is incident to a line from the arrangement, provided by 
n > no (A;). In a subsequent paper we will establish similar statement 
to hyperplanes. 

1 Introduction 

The celebrated Szemeredi- Trotter Theorem JH] states that for n points on 
the plane, the number of m-rich lines cannot exceed 



and this bound is tight in the worst case. This result has numerous appli- 
cations not only in geometry [To! I19j. but also in number theory |13j . The 
Szemeredi- Trotter theorem has various proofs, the most elegant is Szekely's 
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[19]. However the proofs provide very limited insight view of the structure of 
extremal arrangements. It is widely believed that a point-line arrangement 
which defines many incidences has a special, somehow rigid structure. For 
example let us mention here a question of Elekes. Is it true that for every 
c > there is a d > such that if a set of n points on the plane contains 
at least cn 2 collinear triples then at least dn points are along an algebraic 
curve of degree d, where d is an universal constant. 

The main purpose of this paper is to show that any arrangement with 
close to the maximum number of incidences is locally a collection of complete 
geometric graphs. For the sake of simplicity we state the theorem for the 
balanced case, when the number of lines equals to the number of points, but 
it is quite straightforward to see the similar statement for unbalanced cases 
as well. 

Recent work of Gowers j20] and Nagle, Rodl, Schacht, and Skokan 00111 
has established a hypergraph removal lemma, which in turn implies similar 
results to hyperplanes, however a slightly different approach is needed, mainly 
because the higher dimensional extensions of the Szemeredi- Trotter theorem 
are not as well defined as the planar case. To obtain sharp bounds one 
needs certain restrictions on the arrangements. Therefore the corresponding 
structure theorems will appear in subsequent paper. 

A pointset or a set of lines is in general position if no three of the elements 
are collinear or concurrent. 

Theorem 1 For every natural number k and real c > there is a threshold 
n o — no(fc,c) such that if an arrangement of n > no lines and n points 
defines at least cn^ 3 incidences, then one can always find k points of the 
arrangement in general position, such that any pair of them is incident to a 
line from the arrangement. 

As we will see it from the proof, the complete fc-tuple is "local" in a 
sense that for any pair of points, p\ and P2, the number of points from the 
arrangement, incident to the line segment (pi,p2) is less than k. 

2 Proof of Theorem 1 

The main tool of the proof is Szemeredi's Regularity Lemma [§JE]]- We will 

use it's counting lemma form, because it is easier to extend to hypergraphs 
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which we will need for the higher dimensional extensions. Let us prove first 
the simplest case, to show that there is always a triangle. This "simplest 
case" is interesting in it's own right, the statement of Lemma 2 implies Roth's 
theorem [S] about arithmetic progressions on dense subsets of integers. For 
the details we refer to |H] . 

Lemma 2 For every c > there is a threshold n = n (c) and a positive 
5 = 5(c) such that, for any set of n > n lines L and any set of m > cn 2 
points P, if every point is incident to three lines, then there are at least 5n 3 
triangles in the arrangement. (A triangle is a set of three distinct points from 
P such that any two are incident to a line from L.) 

This lemma follows from the following theorem of Ruzsa and Szemeredi 
jH] , which is also called as the triangle removal lemma or the counting lemma 
for triangles. 

Theorem 3 ^ Let G be a graph on n vertices. IfG is the union of cn 2 edge- 
disjoint triangles, then G contains at least 5n 3 triangles, where 5 depends on 
c only. 

The same theorem from a different angle is the following. 

Theorem 4 Let G be a graph on n vertices. If G contains o(n 3 ) triangles, 
then one can remove o(n 2 ) edges to make G triangle-free. 

To prove Lemma 2, let us construct a graph where L is the vertex set, 
and two vertices are adjacent if and only if the corresponding lines cross at 
a point of P. This graph is the union of cn 2 disjoint triangles, every point of 
P defines a unique triangle, so we can apply Theorem 3. 

To determine the number of triangles in any arrangement of lines and 
points seems to be a hard task. A related conjecture of de Caen and Szekely 
|2"T] is that n points and m lines can not determine more than nm triangles. 

One can repeat the same argument, now with k instead of 3. The 
corresponding counting lemma can be proven using Szemeredi's Regularity 
Lemma. The proof is analogous to the Ruzsa-Szemeredi Theorem. There are 
slightly different ways to state the Regularity Lemma, for our purposes the 
so called degree form is convenient. For the notations and proofs we refer to 
the survey paper of Komlos and Simonovits jT] . 
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Theorem 5 (Regularity Lemma) For every e > there is an M = M(e) 
such that if G = (V, E) is any graph and d G (0, 1] is any real number, then 
there is a partition of the vertex-set V into k + 1 clusters Vo, V\, . . . , Vk, and 
there is a subgraph G' C G with the following properties: 

m k<M, 

• \V \ <e\V\, 

• all clusters Vi, i > 1, are of the same size m < \e\V\~\, 

• deg G ,(v) > deg G (v) — (d + e)\V\ for all v G V, 

• e(G\Vi)) = for each i > 1, 

• all pairs G'(Vi, Vj) (1 < i < j < k) are e— regular, each with a density 
either or greater than d. 

Armed with the Regularity Lemma we are ready to prove the following 
statement which is crucial in the proof of Theorem 1. 

Lemma 6 For every c > there is a threshold no = no(c) and a positive 
5 = 5(c) such that, for any set of n > n lines L and any set of m > cn 2 
points P, if every point is incident to k lines, then there are at least Sn k 
complete k— tuples in the arrangement. (A complete k— tuple is a set of k 
distinct lines in general position from L such that any two intersect in a 
point from P.) 

Proof. To avoid having too many degenerate k— tuples, we remove some 
points from P which have many lines incident to them. P', which is the 
subset of P, consists of points incident to at most 100/c lines from L. We 
can apply (1) to see that P' is a large subset of P, say 2\P'\ > \P\. Let us 
construct a graph G where L is the vertex set, and two vertices are adjacent 
if and only if the corresponding lines cross at a point of P'. This graph, G, is 
the union of at least |n 2 edge-disjoint Kk—s. Find a subgraph, G', provided 
by Theorem 5 with e c. In G' we still have some complete Kk—s. Along 
the edges of such a complete graph we have a A;— tuple of V— s such that the 
bipartite graphs between them are dense and regular. This already implies 
the existence of many complete subgraphs, as the following lemma, quoted 
from 1], shows. 
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Lemma 7 Given d > e > 0, a graph R on n vertices, and a positive integer 
m, let us construct a graph G by replacing every vertex of R by m vertices, 
and replacing the edges of R with e— regular pairs of density at least d. Then 
G has at least am n copies of R, where a depends on e, d, and n, but not on 
m. 

Most of the complete A;— vertex subgraphs of graph G' define a complete 
A;— tuple in the arrangement, i.e. the corresponding lines are in general po- 
sition. To see this, let us count the "degenerate" k— tuples, the k— element 
line-sets, where at least one triple is concurrent. The number of concurrent 
triples is at most cn 2 ( 10 ^ c ) < c'n 2 . For every concurrent triple one can select 
k — 3 lines to get a degenerate k— tuple. The expression, c'n fc_1 , is clearly 
an upper bound on the degenerate A;— tuples, therefore most of the com- 
plete graphs on k vertices in G' are complete A;— tuples if n is large enough, 
n > n = n (c). 

The final step of the proof of Theorem 1 is to show that arrangements 
with many incidences always have a substructure where one use Lemma 2. 
We divide the arrangement into smaller parts where we apply the dual of 
Lemma 2. The common technique to do that is the so called cutting, which 
was introduced by Chazelle, see in [TT], about 20 years ago. Here we use a 
more general result, a theorem of Matousek |15j . 

Lemma 8 Let P be a pointset, P C H d , \P\ — n, and let r be a parameter, 
1 C r C n. Then the set P can be partitioned into t sets Ai, A2, • • • , A t , in 
such a way that n/r < |A$| < 2n/r for all i, and any hyperplane crosses no 
more than Oir 1-1 ^) sets, where t — 0(r). 

Here we use the d = 2 case and we choose the value r = /3fcn 2//3 , where 
/3k is a constant, depends on k, which we will specify later. Let us count the 
number of incidences along the lines of L, according to the partition of P. For 
a given line £ G L, we count the sum Yll=i LI D^I/^J' "which is not much 
smaller than the number of incidences on £ over k if £ is rich of incidences, say, 
incident to much more than r 1//2 A; points of P. From the condition of Theorem 
1 and the properties of the partition we have the following inequality. 




£eL i=i 
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Choosing @ k — the inequality becomes 
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Therefore there is an index i, such that 
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If s = 



|Ai|"KI 
k 



then we can partition the points incident to £ into r 

consecutive A;— tuples. We can break the line into r fc— rich line segments and 
consider them as separate lines. Our combinatorial argument in Lemma 6 is 
robust enough to allow such modifications. Then we have some c'n 2 ^ 3 A;— rich 
lines on |Aj| = c^n 1 ^ 3 points. (Another possible way to show that there are 
at least c'n 2 ^ 3 k— rich lines, is to apply the Szemeredi- Trotter theorem, (1), 
to show that most of the lines are not "very rich".) To complete the proof 
of Theorem 1, we apply the dual statement of Lemma 6. 
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